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SYSTEM AND METHOD FOR PROGRAMMATIC 
GENERATION OF CONTINUOUS MEDIA PRESENTATIONS 

FIELD OF THE INVENTION 
This invention relates to the field of composition and presentation of multimedia. 
More specifically, the invention relates to the programmatic generation of multimedia presentation 
sequences. 

BACKGROUND OF THE INVENTION 
A block diagram of a conventional distributed multimedia presentation 
environment is shown in Fig. 1 . A multimedia system in such an environment can produce a 
composed multimedia document (not shown) which is displayed on a presentation device 1 1, such 
as a television or computer monitor. First, various presentation documents 16, 17 are pre- 
composed by authors. This task of generating the presentation documents is called authoring. 
Many different types of documents can be authored, e.g., composition of video presentation 16 or 
composition of Web pages 17. Once these documents are prepared, they are stored in various 
data storage devices 14, 15 such as hard disks, digital video disks or storage devices of a satellite 
broadcasting company. Upon a user's request for presentation of a document, the presentation 
documents are delivered through the network 13 to the presentation device 1 1 for presentation. 
This is referred to as the "pull" mode. The presentation can also be sent by the storage devices 
14, 15 pro-actively (e.g., by the broadcasting companies) to users without any explicit request via 
the network 13 or otherwise. This is referred to as the "push" mode. 
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Therefore, conventionally, multimedia presentation materials are generated before 
the presentation, i.e., at an authoring time. Once generated, presentation is accomplished exactly 
as the presentation materials are pre-composed. The user's capability to interact with the 
presentation is limited through interaction with the control panels 12 on the presentation device 
11. Typical examples of the possible interactions provided by these control panels 12 include 
selection of different materials (channels or URLs), fast forward, fast reverse, pause, etc. 
However, in prior art systems, no dynamic changes to the presentation materials are supported. 

A somewhat enhanced interaction capability is provided by the 3D object model. 
A pre-specified interaction semantic is built into the object. For example, PanoramIX, an image- 
based rendering software from IBM (www.software.ibm.com/net.media), uses environment maps 
(e.g., cylinders with tops/bottoms or polyhedral approximations of spheres) to render the 
background from a fixed viewpoint that does not translate. It supports smooth rotation and 
continuous zoom of the view. PanoramIX also allows pre-composition of a complex scene with 
embedded sprites, audio segments, video clips and 3D objects. It uses a control file that is pre- 
defined during scene authoring, using a special authoring tool. 

Another example of a prior art system that uses pre-composed complex scenes is 
contained in the specifications of the MPEG-4 standard. MPEG-4 employs a BIFS (Binary 
Information for Scenes) composite file that describes a scene hierarchy, along the lines of VRML 
(Virtual Reality Modelling Language) scene graphs. Leaf nodes of the graph are AVOs (Audio- 
Visual Objects) that could be a video or audio or a 3D object or any other component media type. 
In fact, PanoramIX may be viewed as one instance of the MPEG-4 specification. 
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While they do provide some added interaction capability, these systems still fall 
short of providing the full ability to dynamically alter presentations after authoring. 

SUMMARY OF THE INVENTION 
The present invention provides a method by which a user can programmatically 
5 compose presentation materials which have been authored. First, users compose different 
presentations by applying dynamic synchronization, placement and selection of media stream 
segments under the control of a program. In contrast to pre-composed presentations where the 
* 3 selection of media segments and the synchronization and placement of those media segments are 
''• : 4 predetermined (e.g., MPEG-2 video), in the present invention, the selection, synchronization (for 
1@0 example, the determination of starting times for each media segment and delay insertion) and 
; ~ placements (in the display window) are controlled dynamically through the composition program. 

: j Second, the composition programs themselves are rule-based and event-driven, and therefore can 

l.y 

□ be dynamically altered. Third, the program that controls composition may be downloaded or pre- 
110 resident at the presentation site. Additionally, various commands (understood by this program) 
15 may also be streamed to the program via separate logical channel(s). Finally, the composition can 
be accomplished both at the client site or at the server site based on user interactions at the client 
as well as network and server load. 

Specifically, a method of dynamically generating a presentation sequence from a 
plurality of authored presentation documents is provided including the steps of receiving the 
20 plurality of authored presentation documents from a plurality of data sources, applying the 
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plurality of authored presentation documents to a set of presentation rules, and generating the 
presentation sequence in response to the applying step. 

Preferably, the applying step includes the steps of testing for satisfied rule 
conditions and applying the plurality of authored presentation documents to a set of presentation 
rules in response to the satisfied rule conditions. In addition, it is preferable that the method 
include the step of receiving user input and wherein the generating step includes the step of 
generating the presentation sequence in response to the received user input. The method 
preferably includes the further step of modifying the set of presentation rules in response to the 
received user input. 

The method can further include the step of sensing an external event, wherein the 
generating step includes the step of generating the presentation sequence in response to the 
sensed external event. Additionally, the method can further include the step of modifying the set 
of presentation rules in response to the sensed external even or in response to the received 
presentation documents. 

The method can further include the steps of receiving meta data from the data 
sources and modifying the set of presentation rules in response to the received meta data. 

The authored presentation documents are preferably authored presentation 
sequences and the generated presentation sequence is preferably a composite presentation 
sequence. 

A method for programmatic generation of continuous multimedia presentations by 
a station capable of receiving at least one presentation and sensed events is also provided 
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including the steps of maintaining a library of rules, receiving at least one presentation; selecting at 
least one event to be sensed; receiving the event, testing each rule in the library for each received 
event; and optionally applying each rule to the presentation for each received event in response to 
the testing step to modify the presentation or to generate a new presentation. 

5 A method for dynamically composing a presentation from a plurality of multimedia 

components is also provided including the steps of selecting one or more of the multimedia 
components to be identified as an initial portion of the presentation; programmatically selecting 
one or more other multimedia components to be identified as a subsequent portion of the 

= 3 presentation; disposing the subsequent portion with or following the initial portion; and 
104 synchronizing the selected components to form the presentation. 

: ''i Preferably, this method further includes the step of presenting the presentation. 

' v The multimedia components can be continuous media components or non-continuous media 

I'.j. 

I; ; components. They can also be audio or video components or text or image components. 
P It is preferable that the programmatically selecting step is responsive to input 

15 parameters, a past presentation history or a current state and to line content or meta-data. 

A program storage device, readable by a machine, tangibly embodying a program 
of instructions executable by the machine to perform method steps for dynamically generating a 
presentation sequence according to the above-referenced method steps is also provided. 

A programmatic presentation generator into which presentation documents are 
20 received is provided, including a rule base; a program state; and a computation engine, wherein 
the engine receives rule data from the rule base and state data from the program state and 
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generates a presentation sequence responsive to the presentation documents, the rule data and 
state data. 

A set top box for receiving channels and sending presentation sequences to a 
digital television is also provided, including the programmatic presentation generator of the 
present invention. 

Finally, a television set for receiving channels and generating presentation 
sequences is provided including the programmatic presentation generator of the present invention. 

BRIEF DESCRIPTION OF THE DRAWINGS 
Further objects, features and advantages of the present invention will become 

apparent from a consideration of the following detailed description of the invention when read in 

conjunction with the drawing figures, in which: 

Fig. 1 is a block diagram of a conventional distributed multimedia presentation 

environment; 

Fig. 1 A is a block diagram of a presentation environment incorporating the 
programmatic presentation generator of the present invention; 

Fig. IB is a block diagram which shows one embodiment of the programmatic 
presentation generator of Fig. 1 A incorporated as part of a set-top box; 

Fig. 1C is a block diagram showing an Intelligent TV Guide example of using the 
programmatic presentation generator of the present invention; 

Fig. ID is a block diagram showing an intelligent slide show example of using the 
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programmatic presentation generator of the present invention; 

Fig. 2 is a block diagram of the presentation environment of Fig. 1 A showing the 
framework of the programmatic presentation generator according to the present invention; 

Fig. 3 is a block diagram of a rule of the programmatic presentation generator 
framework of the present invention; 

Fig. 4 is a flowchart of the process to generate a presentation sequence according 
to the present invention; 

Fig. 5 is a block diagram of a conventional presentation sequence; 

Fig. 6 is a block diagram of a conventional static object; 

Fig. 7 is a block diagram of a composite presentation sequence of the present 

invention; 

Fig. 8 and Fig. 8 A are block diagrams showing the operations to define and change 
the temporal properties of a presentation sequence according to the present invention; and 

Fig. 9, Fig. 9A and Fig. 9B are block diagrams for the operations to define and 
change the temporal properties of a composite presentation sequence using multiple presentation 
sequences according to the present invention. 



DETAILED DESCRIPTION OF THE INVENTION 
Figure 1 A is a block diagram of a distributed multimedia presentation environment 
utilizing the programmatic presentation generator of the present invention. In Fig. 1 A, a 
presentation device 1 Al, such as a TV or computer monitor, presents a presentation sequence 
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1 A5 generated by the programmatic presentation generator 1 A4. This programmatic presentation 
generator 1 A4 can generate the presentation sequences using the data pushed from data source 
1 A3 or it can pull the data from the data source 1 A2 when required. These data sources can 
include a hard disk, digital video disk, satellite or TV broadcasting company. The programmatic 
presentation generator 1 A4 according to the present invention permits a presentation sequence, 
such as TV programs, MPEG sequences, etc., to be dynamically generated or, once generated, to 
be dynamically modified reflecting user input, presentation rules and external events. 

This programmatic presentation generator 1 A4 could be incorporated on different 
software components. For example, it can be incorporated as part of the set-top box environment 
or executed as part of the server software. 

Fig. IB shows the block diagram of a programmatic presentation generator 1B5 
used in a set-top box 1B2. A television 1B1 receives the presentation sequences from a set-top 
box 1B2. Alternately, one skilled in the art can appreciate that the programmatic presentation 
generator can be located in the television 1B1 itself, assuming the functionality of the set-top box 
1B2 is incorporated therein. In prior art systems, users could only select a channel of interest out 
of the multiple incoming channels 1B4. However, by placing the programmatic presentation 
generator 1B5 in a set-top box 1B2 (or in a television), users can dynamically modify or compose 
a presentation sequence using the multiple incoming channels 1B4. The components of a 
programmatic presentation generator 1B5 (i.e, rule base, engine, program state) are described 
hereinbelow. 

Fig. 1C is a block diagram which shows an Intelligent TV Guide as an example 
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using the programmatic presentation generator of the present invention. Multiple channels 1C1 
are connected to a TV 1C3. Through certain channels 1C2, additional meta (command) data can 
also be transmitted. Using the programmatic presentation generator of the present invention, a 
user can watch the programs in which he is most interested. This selection could be specified 
using rule-based programs as described hereinbelow. This program could use simple keyword 
matching, similar to that used in an internet search engine, over the meta data. For example, a 
program could be added to the rule-base (described hereinbelow) so that automatic selection of 
the contents is executed every 30 minutes. As an example, the rules can be specified in the 
following way: First, repeated selection of the same content should be avoided. Second, violent 
programs may not be selected. Also, if a new interesting program shows up on the display and if 
it is within the threshold of preference (i.e., evaluated via a user-defined function as sufficiently 
interesting), the next two best programs are shown in the top corner windows 1C4, 1C5 for 2 
minutes. This functionality significantly assists a user with the task of choosing a program to 
view. 

Fig. ID is a block diagram which shows an Intelligent Slide Show as another 
example of an application of the programmatic presentation generator of the present invention. 
The composed, original slide show 1D1 consists of 4 slides (1D21 through 1D24). Two of the 
slides (1D21 and 1D23) include URLs. The slide show 1D1 was designed to be played back in 
the regular playback order shown in 1D2, i.e., in the order of 1D21, 1D22, 1D23 and 1D24. A 
different slide show can be generated using the programmatic presentation generator of the 
present invention as shown in 1D3. The new (extended playback) slide show 1D3 has been 
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generated by extending 1D1. In the slide show 1D3, the contents of the URLs included in the 
original slides (1D21 and 1D23) are retrieved and included as parts of the slides. The inclusion of 
the pages referenced by the URLs can be accomplished in various ways. For example, the slide 
1D21 was extended such that the referenced URL page 1D3 12 is included as a part of the same 
slide as page 1D3 1 1 as shown in 1D3 1. In contrast, the referenced URL page on slide 1D23 is 
included as a separate slide 1D34. 

Fig. 2 is a diagram which shows a more detailed view of the programmatic 
presentation generator according to the present invention along with a distributed multimedia 
presentation environment. Data sources 21, 22, 23, 24 and network 25 are the same as those in 
Fig. 1 . The framework 26 of the programmatic presentation generator is composed of a rule-base 
261, a presentation state 262 and a computation engine 263. 

The rule-base 261 is composed of a set of rules to be described hereinbelow. The 
rule-base 261 can be modified (i.e., adding, deleting or modifying one or more rules) in several 
ways. First, modification can occur by local user input. Second, one or more existing rules can 
implicitly modify the rule-base. Finally, data contained in the presentation sequence(s) received 
from the data source(s) or meta data received from the data source(s) in a separate logical channel 
(whether initiated by a user pull command or by a data source push command) can alter the rule- 
base. 

The presentation state 262 consists of a presentation history and a set of state 
variables. The presentation history includes the history of such information as presentation 
contents, user interactions and external events up to a certain point in time. A state variable is a 
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program-defined variable which serves to mark and represent the status of a presentation so that it 
can be referred within a program (rules). 

The computation engine 263 includes computational logic which monitors the 
presentation state 262, interprets the rule base 261 and sensed external events, and modifies the 
presentation sequence, thereby generating a new presentation sequence 264 and a resulting 
presentation 27. The sensed external events can include input timing parameters and user inputs. 
Upon each sensed external event, the engine 263 serializes the rules in the rule-base 261, and 
applies each rule to the current presentation sequence. That is, given a rule-base RB = {R u R 2 ,. . 
.jRn}, where Rj is a rule, and the current presentation sequence P, the engine generates the new 
presentation sequence P n as follows: 

P={R 1 }=>P 1 ={R 2 }=>P 2 ={R 3 }=>P 3 ={R n }=>P n 
where P { is the presentation sequence generated from the presentation sequence P^ by applying 
rule Rj. 

Fig. 3 shows a block diagram of a rule within the framework of the programmatic 
presentation generator of the present invention. A rule is a pair of TEST 3 1 and ACTION 32. 
TEST 3 1 specifies the condition 3 1 1 to be met for the ACTION to be fired. It can be any 
predicate over the presentation states such as "if at time t i5 a specific image has been presented." 

ACTION 32 is a sequence of operations on a presentation sequence to generate a 
new presentation. These operations can be realized by any code segments using any programming 
language such as C, PASCAL, JAVA, etc. The operations in an ACTION are preferably 
composed of four parts 321, 322, 323, 324. The first code segment 321 is to specify the selection 
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of different content data to be included in the presentation. The second is the code segment 322 
which specifies different temporal orders of presentation. This code segment uses the operations 
which will be described in Figs. 8, 8A, 9, 9A and 9B. The third code segment 323 is to specify 
the different spatial layouts among different presentation content data. Lastly, there can be code 
segments 324 for changing presentation attributes, e.g., quality, color, etc. In addition, code 
segments to change the presentation states 325 (e.g., by utilizing state variables) can be 
interleaved with the above-mentioned operations. Here, annotations in the input presentation can 
be monitored and state variables can be adjusted accordingly. 

In the application of a rule 1^ = <TEST,ACTION> to a presentation sequence P, 
first, it is tested if the condition TEST is met and if so, the code segment ACTION is applied to 
the presentation sequence P. More mathematically, when the condition TEST is satisfied (is 
TRUE), an ACTION can be specified as P old =ACTION=->P new where P old is the current 
presentation sequence and P new is the new presentation sequence. 

Fig. 4 is a flowchart illustrating a preferred method for the computation engine 263 
to generate a new presentation sequence upon detection of an event. In the diagram, the current 
presentation is represented by P, the rule-base by {R 2 , R 2 , . . ., RJ, each rule in the rule-base by 
Ri, the number of rules in the rule-base by n, and the new presentation generated at each 
computation step by P ncw . In step 41, a new event is sensed and the number of rules, n, in the rule 
base is received. Next, the engine initializes the index variable i to 0 and initializes the new 
presentation sequence P ncw to the current presentation sequence P in step 42. Then, in step 43, 
the index variable i is increased by 1. In step 44, the index variable i is compared to the number of 
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rules, i.e., n, in the rule-base. If the index variable is greater than the number of rules, the current 
P ncw is output as a new current presentation by setting P = P ncw . Otherwise, in step 45, the rth 
rule, Ri is selected and is tested. If the TEST of Rj is not TRUE, the method proceeds in step 43. 
Otherwise, if the test of Ri is TRUE, in step 46, the ACTION of Ri is applied to P ncw to update 
P new . Then, the method continues with step 43. 

Fig. 5 shows a block diagram of a presentation sequence D. The presentation 
sequence D is composed from a sequence of static objects 51, 52, 53, 54. . ., etc. and then by 
assigning the relative time t k to each static object 51, 52, 53, 54 . . etc. in the presentation 
sequence. A static object is a presentation object without any temporal property. Examples of 
the static objects include an image, text, a video frame, etc. 

Fig. 6 shows a block diagram of a composite static object. A composite static 
object is composed by spatially relating multiple composite objects 61 and 62 on a screen. The 
description of the spatial relation can be accomplished by using techniques known in the prior art. 
One such technique can be found in IBM Research Report, Hyperstory, M. Kim and J. Song, 
1995. 

Fig. 7 shows a block diagram of a composite presentation sequence. A composite 
presentation sequence consists of two or more presentation sequences. The composite 
presentation sequence in Fig. 7 consists of three presentation sequences Dl, D2 and D3. 

The operation to create a presentation sequence can be represented by Create, i.e., 
Create (< ( Sl , 0, fe, t 2 ), . . ., (s n , tj >). 
Given a sequence of static objects, s l5 s^. . .,s m a presentation sequence < (s^ tj, fo, t 2 ), . . (s m 
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> is created in which static object s { is displayed during virtual time interval (t b tj +1 ). For 
example, in Fig. 5, the static object 51 is shown during the virtual time interval (t l9 1 2 ), and static 
object 52 is shown during virtual time interval (t 2 , t 3 ), etc. 

A presentation sequence can also be generated from one or more existing 
presentation sequences using temporal operators. A set of temporal operations for the description 
of temporal property can also be found in "Composition and Search with Video Algebra" by Ron 
Weiss, Andrej Duda and David K. Gifford in IEEE Multimedia, 1995. 

Fig. 8 and Fig. 8A illustrate different operations on a presentation sequence 
according to the present invention. The intra-sequence operations include: 

Head (D,i) 81: Given a presentation sequence D = <(s b tj), (s^ t 2 ), . . (s n , O >, 
a subsequence ending at the z'th object is selected as a new presentation sequence. That is, D ncw = 
<(Si, tj, (a* t 2 ),. . ., ( Si , 0 >. 

Tail (D,i) 82: Given a presentation sequence D = < (s l5 t x ) 9 (s^ t 2 ),. . O a 
subsequence starting from the /th object is selected as a new presentation sequence. That is, D ncw 
= < (s i5 ti), (s i+1 , t i+1 ),- • .,(Mn) >■ 

Delay (D,t) 83: Given a presentation sequence D = < (s^ t^^Sj, t 2 ),. . .(s n , O >, a 
new presentation sequence D new is created in which the virtual display times are delayed by t 
virtual time units. That is, D new = < (s l5 t x + 1), (s* t 2 +t),. . (s n , t n + 1 ) > 

Stretch (D,r) 84: Given a presentation sequence D = <(s!, t^, (s^ t 2 ),. . (s n , tj >, 
a new presentation sequence D new is created in which the starting times t 4 of static object s { is 
scaled to t { x r, i.e., D ncw = <(s lf t x x r), (s* t 2 x r), . . .,(s n , t n x r)>. 
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Repeat (D,i,j,r) 85: Given a presentation sequence D, a new presentation sequence 
is created in which the subsequence from s { to s j is repeated r times. 

A composite presentation sequence also can be generated by temporally relating 
multiple presentation sequences (i.e., via inter-sequence operations) according to the present 
invention. Fig. 9, Fig. 9 A, and Fig. 9B show the block diagrams for such inter-sequence temporal 
operations. They include: 

Meet (D u D 2 ) 91 : Given two presentation sequences D x and D 2 , a new 
presentation sequence D ncw is created in which the starting time of D 2 is the same as the ending 
time of D v 

Co-Start (D l5 D 2 ) 92: Given two presentation sequence D x and D 2 , a new 
presentation sequence D ncw is created in which the starting time of D x and D 2 are the same. 

Co-End (D 1? D 2 ) 93: Given two presentation sequences D x and D 2 , a new 
presentation sequence D new is created in which the ending time of T> x and D 2 are the same. 

Interleave (D l3 D 2 , d) 96: Given two presentation sequences, D r and D 2 , a new 
presentation sequence is created in which the two sequences are interleaved. Here the parameter 
d represents the delay in the new presentation sequence of the start time of D 2 relative to the start 
time of the presentation sequence D x . 

Given a presentation sequence, a presentation is constructed by associating a start 
time and a play rate. More mathematically, P = <D, t s , r> is a presentation where the presentation 
sequence D is started at real time S with play rate r. Here the start time t 8 and the play rate r are 
used to map virtual times to real times. For example, virtual time t; is mapped to real time 
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t. + t i xr. 

Thus, an important feature of the framework of a programmatic generation of a 
presentation according to the present invention is that the rule-base can be dynamically changed 
by inserting/deleting/modifying rules and the changes can be dynamically reflected in the 
generation of a presentation with the engine. 

The framework of the programmatic generation of the presentation sequence 
according to the present invention can be made in either the multimedia server or in the client 
presentation system or both. 

Now that the invention has been described by way of a preferred embodiment, 
various modifications and improvements will occur to those of skill in the art. Thus, it should be 
understood that the preferred embodiment is provided as an example and not as a limitation. The 
scope of the invention is defined by the appended claims. 
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